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A BERRY ESSEEN THEOREM FOR SAMPLE QUANTILES 
UNDER WEAK DEPENDENCE^ 

By S. N. Lahiri and S. Sun 

Texas A&M University and Wright State University 

This paper proves a Berry-Esseen theorem for sample quantiles 
of strongly-mixing random variables under a polynomial mixing rate. 
The rate of normal approximation is shown to be 0{n~^^^) as n — > oo, 
where n denotes the sample size. This result is in sharp contrast to 
the case of the sample mean of strongly-mixing random variables 
where the rate 0{n~^^^) is not known even under an exponential 
strong mixing rate. The main result of the paper has applications 
in finance and econometrics as financial time series data often are 
heavy-tailed and quantile based methods play an important role in 
various problems in finance, including hedging and risk management. 

1. Introduction. Sample quantiles of time series data play an important 
role in robust statistical inference about various process parameters, partic- 
ularly when the underlying distribution is heavy-tailed or when presence of 
outliers is suspected [cf. Butter, Filzmoser, Gather and Rousseeuw (2003)]. 
Although asymptotic normality of the sample quantiles under dependence is 
known, accuracy of the corresponding normal approximation has remained 
largely unexplored. In this paper, we establish a Berry-Esseen theorem for 
the sample quantile with the optimal rate 0(n~^/^) for a large class of 
weakly dependent time series. Apart from its foundational role in statisti- 
cal inference for time series data, the Berry-Esseen result of the paper also 
has important applications in finance and econometrics. It is well known [cf. 
Mittnik and Rachev (2001)] that financial time series data often are heavy- 
tailed. As a result, quantile based methods are being increasingly developed 
and employed in diverse problems in finance, such as, quantile-hedging [cf. 
Follmer and Leukert (1999)], optimal portfolio allocation [cf. Dmitrasinovic- 
Vidovic and Ware (2006)], risk management [cf. Melnikov and Romaniuk 
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(2006)], and so forth. The recent surge in interest in using quantile based 
methods in finance and related areas cahs for a better understanding of the 
theoretical properties of the sample quantiles of time series data in greater 
generality. The main result of the paper takes a step in this direction and 
establishes the optimal rate in the Berry-Esseen theorem for a large class of 
weakly dependent processes that require a polynomial strong mixing condi- 
tion. 

To describe the result of the paper, let {Xi^i^z be a sequence of stationary 
random variables with strong mixing coefficient ax{n) = sup{|i-*(yl r\B) — 
P{A)P{B)\:A£ T^,B e T^^^,k£ Z}, where TI = a{Xi:i E [0,6] n Z) is 
the (T-field generated by {Xi : z € [a, 6] n Z} , — 00 < a < 6 < 00 and where Z = 
{0, ±1, ±2, . . .} denotes the set of all integers. Let F denote the distribution 
function (d.f.) of Xi, that is, F{x) = P{Xi <x),x£ R. For p G (0, 1), let 

(1.1) F~\p)=mf{x:F{x)>p} 

denote the pth quantile of F. An estimator of the population parameter 
F~^{p) is given by the sample pth quantile 

(1.2) F-i(p)=inf{x:F„(x)>ri, 

where Fn{x) = Yll=i < x), x G M, denotes the empirical distribution 
function (e.d.f.) of Xi, . . . and where / denotes the indicator function, 
with I{S) = or 1 according as the statement S is false or true. When 
the process {Xi\i^i is strongly mixing at a (suitable) polynomial rate [i.e., 
oix{n) = 0{n~°-) as n ^ 00 for some suitable a G (0, 00)] and F is differen- 
tiable at F~^{p) with a positive derivative f{F~^{p)) > 0, it is known [see, 
e.g., Sen (1972), Sun and Lahiri (2006)] that 

(1.3) V^{f-Hp) - f-Hp)) -^'^n(o, jl^^^) 

as n — > 00, where cr'^{x) = J2iez Cov(/(Xi < x),I{Xi^i < x)), x G M. 

The main result of this paper refines (1.3) by specifying the rate of normal 
approximation to the distribution of ^/n{F~^ (p) — F~^{p)). More precisely, 
it is shown that if the process {Xi}i^z is strongly mixing at a certain poly- 
nomial rate and if the regularity conditions set forth in Section 2 hold, then 

snp\PiV^iF;;Hp) - F^Hp)) < ^) - ^(^/roo(p))| = 0(n-V2) 

(1.4) 

as n ^ 00, 

where r^(p) = cr^(F~^(p))//^(F~^(p)), and where $ denotes the d.f. of 
a standard normal variate. Thus, the Berry-Esseen theorem holds for the 
sample quantile of strongly mixing random variables under the conditions 
of Section 2. This is in marked contrast to the case of the sample mean of 
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strongly mixing random variables, where a Berry-Esseen theorem with the 
rate 0(n~^/^) of normal approximation is not available. The best known 
rate for sums of strongly mixing random variables with an exponentially 
decaying mixing coefficient is only 0(n~^/^ (log n)'^) for some suitable c > 
[cf. Tikhomirov (1980), Dasgupta (1988)]. The rate ©(n'^/^) fgj. ^j^g sample 
mean is available either when the X-process satisfies certain stronger forms 
of dependence conditions, like (/)-mixing [cf. Donkhan (1994)] or when the 
distance between the two probability distributions is measured using certain 
smooth metrics. Rio (1996) obtained the 0(n~^/^) rate for the sample mean 
under a suitable uniform mixing condition, which is known to be stronger 
than strong mixing [cf. Donkhan (1994)]. For certain smooth metrics, Utev 
(1991) established the same optimal rate for scaled sums of Banach space 
valued random elements under (/>-mixing. For strongly mixing random vec- 
tors, the 0(?i~^/^) rate for the sample mean (under suitable smooth metrics) 
follow from the results of Gotze and Hipp (1983) and Lahiri (1993) under 
an exponential mixing rate, and from those of Lahiri (1996) under a poly- 
nomial mixing rate. Recently, Bentkus and Sunklodas (2007) and Sunklodas 
(2007) have established the 0(n~^/^) rate for the sample mean under differ- 
ent smooth metrics, for both strongly mixing random variables and strongly 
mixing random fields. However, for most statistical applications, approxi- 
mations to the distribution function is needed and the best known rate for 
approximation to the distribution function of the sample mean for strongly 
mixing random variables is still 0(n~^/^(logn)'^) for some c > 0. 

Although the validity of the Berry-Esseen theorem with rate 0(n~^/^) for 
the sample mean of strongly mixing random variables remains unsolved, the 
main result of this paper establishes the desired optimal rate 0{n~^^^) for 
the sample quantiles in the strong mixing case, requiring only a polynomial 
decay of the mixing coefficient. In particular, the Berry-Esseen theorem of 
the paper extends the results of Reiss (1974) who establishes the 0(n~^/^) 
rate of normal approximation to the distributions of sample quantiles un- 
der independence. The proof of the main result here makes use of some 
arguments developed by Gotze and Hipp (1983) and Lahiri (1993, 1996) for 
deriving Edgeworth expansions for sums of strongly mixing random variables 
and also crucially exploits properties of the probability integral transform 
F~^(-) of the e.d.f. F^. We also consider some important classes of dependent 
processes and verify the regularity conditions used in the paper. 

The rest of the paper is organized as follows. In Section 2, we state the 
conditions, verify these for different classes of weakly dependent processes 
and state the main result. The proof of the main result along with some 
auxiliary lemmas are given in Section 3. 



4 



S. N. LAHIRI AND S. SUN 



2. Main results. 

2.1. Conditions. We prove the Berry-Esseen bound under a general frame- 
work introduced by Gotze and Hipp (1983) in their seminal paper on asymp- 
totic expansions for sums of weakly dependent random vectors. Suppose 
that the random variables {Xi : i G Z} are defined on a probability space 
(0, P) and that {T>i : i € Z} is a collection of sub-cr-fields of For — oo < 
a <b < oo, let = a{{Vi : i G [a, 6] n Z}) denote the smallest cj-field con- 
taining {Vi,a < i <b,i e Z}. Also, let = F~^{p). RecaU that cr'^{x) = 
EigzCov(I(Xi <x),I{Xi+i <x)), xGM. Let N = {1,2,...} denote the set 
of all positive integers. We shall make use of the following conditions: 

(C.l) (i) F is differentiable at with derivative /(^p) G (0,oo). 

(ii) fT^(ep)G(0,oo). 
(C.2) There exist constants d G (0, 1) and ao > 12 such that for all n > 1, 

a{n) = sup{\P{A n B) - P{A)P{B)\ :A€Vl^,Be i G Z} 

(2.1) 

(C.3) There exist constants d£ (0,1) and /?o > 12 and random variables 
i G Z, n G N such that xt„ is P-l^-measurable and 

(2.2) /3{n) = E\Xi - Xjj < d-^n'^^" for ah i G Z, n G N. 

(C.4) There exist constants d G (0, 1) and 70 > 12 such that for all m,n,r € 
N and A G V;+Z 

(2.3) |-P(^|^j : j^r)- P{A\Xj:0<\r-j\ <m + n)\ <d"^n"^°. 

(C.5) There exist a constant d G (0, 1) and sub-a-fields Ci,i G Z, of such 
that for every z G Z, a{'Dj : j 7^ i) U o-{{Xj CCi and 

(2.4) PiG^{^p) = l)<p-d 

where G,{y) = P{X, < y\Ci), y G M. 

We now comment on the conditions. Condition (C.l) is a standard condi- 
tion that is frequently used to ensure a nondegenerate limit distribution of 
the pth sample quantile under dependence. In the independent case, (C.l)(i) 
is also known to be necessary; see Lahiri (1992). Conditions (C.2)-(C.4) are 
similar to the conditions introduced in Gotze and Hipp (1983) for deriv- 
ing asymptotic expansion for sums of weakly dependent random vectors, 
where the right-hand sides of (2.1)-(2.3) were assumed to be exponentially 
decaying functions of n. The reduction to the polynomial rate here heavily 
relies on Lahiri (1996) which extends Gotze and Hipp's (1983) results al- 
lowing polynomial decay of the coefficients in (2.1)-(2.3). Condition (C.2) 
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is a strong mixing condition on the auxiliary cr-fields T>j^s, which together 
with condition (C.3), imposes an approximate strong-mixing structure to the 
given random variables {Xjjjgz- Condition (C.4) is an approximate Markov 
condition and in particular, it is satisfied if {Xjjjgz is an mth order Markov 
process for a fixed m G N. Condition (C.5) is a key regularity condition that 
perhaps needs some detailed discussion. To get some insight into condition 
(C.5), first suppose that the Xj's are independent. In this case, if we take 
Vj = a{Xj),j E Z, and Cj = a{{Xi : i ^ j}), then Gi{Cp) = P{Xi < ip\Ci) = 
P{X\ <£,p) =p, making the probability on the left-hand side of (2.4) zero, 
and hence, condition (C.5) easily holds. For the dependent case, however, 
Gi{^p) is a random quantity. In this case, if P{Gi{S,p) = 1) =p, then one can 
show that the conditional distribution of I{Xi < ^p) given Ci is degenerate 
at 1 on a set of probability p while it is degenerate at on the complemen- 
tary set of probability q = 1 — p. As a result, the conditional characteristic 
function of I{Xi < ^p) given Ci becomes identically equal to 1 in absolute 
value on all of $7, and our bound on the factorized conditional characteristic 
function of the scaled sum n~^/'^jyj=i{H-^i ^ ^p) ~ ^i^p)) '^o longer pro- 
vides a useful estimate for the discrepancy between the d.f.s of the sample 
quantile and the limiting normal distribution. However, once this degener- 
acy is ruled out by condition (C.5), it is possible to derive a suitably small 
upper bound on the conditional characteristic function of the scaled sums 
n~^^'^ J2]j=i{^ i-^i ^y) ~ P{y)) uniformly over y in a neighborhood of ^p (cf. 
Lemma 3.3 in Section 3). We exploit some basic properties of the quan- 
tile function in conjunction with this bound to establish the 0(n~^/^)-order 
bound for the sample quantiles. 

As in Gotze and Hipp (1983), the cr-fields I'j's and Cj's are introduced 
to add fiexibility in the verification of conditions (C.2)-(C.5). Below we 
consider some important examples and choose the cr-fields T>j's and Cj's 
suitably to show that condition (C.5) is quite unrestrictive. 

2.2. Examples. 

Example 2.1. Suppose {Xi}i^z is m-dependent for some m G ZU {0}, 
that is, cr{{Xi : i < k}) and cr{{Xi : i > k + m + 1}) are independent for all 
k gTj. Then we take T>j = cr{Xj) and Cj = (j{Xi -.i ^ j) , j £ 1^. Then it is easy 
to check that conditions (C.2)-(C.4) hold with = Xi for alH G Z, m e N 
and with aoi/3o)7o arbitrarily large. Furthermore, in this case, condition 
(C.5) reduces to 

(2.5) P{P{Xo < ^p\Xi : < |i| < m) = 1) < p. 

Let Go denote the conditional distribution of Xq given {Xi:0 < \i\ < m}. 
Suppose that there exist a set A£ with P{A) > and real numbers e, o, b 
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with e £ (0,1/2), a < < b such that Go puts at least e mass on (a — e, a] 
and on (b, b + e] on the set A, that is, if 

(2.6) Go{{a- e,a])> e, Go{{b,b + e]) > e for all w e ^. 

We claim that (2.5) holds if (2.6) holds. To see this, note that (writing Go 
also to denote the distribution function), 

p = F{Cp) = EGo{Q 

(2.7) 

= P{Go{Q = 1) + EGo{Ql{0 < G(ep) < 1) 

so that P(Go(Cp) = 1) < P- If possible, now suppose that (2.5) does not hold, 
that is, p = P{Go{Q = 1). Then by (2.7), EGoi^p)I{0 < Goi^p) < 1) = 0, 
which implies that P{Go{^p) G (0, 1)) = 0. Consequently, 

PiGoiQ = 0) = 1 - [PiGoiQ G (0, 1)) + P{Go{Cp) = l)] = l-p. 

But by the monotonicity of Gq, 

PiA) = PiA n {GoiQ = 0}) + P{A n {Goi^p) = 1}) 

< P({Go((a - e, a]) > e} n {Go{Q = 0}) 

+ P({Go(6, b + e])>e}n {Go{^p) = 1}) 

= P(0) + P(0) = O, 

which contradicts the fact that P{A) > 0. Hence, the claim is proved. 

Example 2.2. Let {Yi}i^z be a stationary homogeneous Markov pro- 
cess with transition probability function P{-;-) and stationary distribution 
u. Let Xi = H{Yi),i G N, where H \s a, Borel measurable function. Suppose 
that 

(2.8) \P{x;A)-P{y;A)\<l 

for all G M and ^ G S(M), the Borel cj-field on R. Then by (iii) on 
page 219 of Gotze and Hipp (1983), conditions (C.2)-(C.4) hold with Vj = 
a{Yj) and xt^ = Xj for all men,j el. where (2.1)-(2.3) are satisfied 
with arbitrarily large positive real numbers aO)/3o)7o- For condition (C.5), 
we take Cj = <y{{Yi : i ^ j}),j G Z. Next, suppose that there exists a cj-finite 
measure /x such that P{x; ■) <^ fi for all a; G M and v <^ fi. Write fo{x) and 
fi{x,y), respectively, for the density of and P{x; •) with respect to /i. Also, 
let A = H~^{{—oo,^p]). Suppose that there exist sets Ai C A and A2 C A'^ 
with /i(Aj) > for i = l,2 such that 

(2.9) fo{x)>0, /i(x,y)>0 for all x, y G U ^2- 
We claim that condition (C.5) holds under (2.9). 
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To prove the claim, first note that the conditional distribution of Yq given 
{Yi : i / 0} is given by 

(2.10) P{YoeB\Yi:i^O)= f go{Y_l,Yl■,y)^iidy), Be 

Jb 

where 

9o{y-i,yi;yo) 

/i(y-i,yo)/i(yo,yi) 



if the denominator is positive, 



Iufiiy~i,y)fiiy,yi)fj'idy)' 

0, otherwise. 
Relation (2.10) can be easily established by verifying the integral equation 

/ P{YoeB\Yr.i^O)dP = P{Dkn{YoeB}) 

for all B G B{M.) and for all sets Dk of the form = no<|j|<fc{-^ ^ ^i} ^'^^ 
ken, where Bi G B{R) for all i. 

Next note that by (2.9) and (2.10), for all (y_i,yi) G x A2, 

P{Yo G A\y^i,yi) > P{Yo G Ai\y^i,yi) > 0, 
P{Yo G A'\y^i,yi) > P{Yo G A2|y_i,yi) > 0. 

Consequently, GoiQ = P{Xq < Cp\Yi, i ^ 0) = P{Yq G A\Y,,i / 0) = ^(^0 G 
^|y_i,yi) G (0,1) for all {Y_i,Yi) G ^1 x^2- Since by (2.9), P{Y_i G Ai,Yi G 
A2) > 0, by the identity given in (2.7), P{Go{(,p) = 1) <p. Hence, condition 
(C.5) follows. 

Example 2.3. Let {Yj}j^z be a stationary zero- mean unit variance 
Gaussian process with spectral density /(A) and let Xj = Ho{Yj),j G Z 
for some Borel measurable function Hq : M ^ R. For this example, we set 
= cr{Yj) and = Xj for all j G Z, m > 1. Then it is clear that con- 

dition (C.3) holds with an arbitrarily large /?o G (0,oo). Next, note that by 
Theorem V.6.8 of Ibragimov and Rozanov (1978), the strong mixing coef- 
ficient a(-) of the Gaussian process {lijjgz satisfies a{n) = 0{n~^'^~^) as 
n ^ 00 for some 6 G (0, 1) if and only if /(A) is of the form 

(2.11) /(A) = \piexpiiX))\^wiX), A G (-vr, n] 

where p{z) is a polynomial with zeros on the unit circle {\z\ = 1} and where 
w{\) is a function that is bounded away from zero and is 12-times differen- 
tiable such that the 12th derivative satisfies a Holder's condition of order 5. 
Thus, condition (C.2) holds under (2.11). Also, note that by the arguments 
on page 220 of Gotze and Hipp (1983), condition (C.4) holds with a 70 > 12. 
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To verify condition (C.5), we take Cj = a{{Yi:i ^ j}). The arguments in 
Example 2.1 imply that condition (C.5) is not true if and only if 

(2.12) P{Go{^p) = l)=p and P{Go{^p) = 0) = 1 - p, 

where Go(Cp) = P{Xo < ^p\Yi:i / 0) = P{Ho{Yo) < ^p\Yi:i / 0). Since the 
conditional distribution function of Yq given (1^ : i 7^ 0) is normal and the 
sets Hq^{—oo,(,p] and Hq^{(^p, 00) both have positive probabilities under 
N(0, 1) [as P{Hq{Yo) < ^p) = p G (0, 1)], by the absolute continuity of nor- 
mal distributions, Go(^p) G (0,1) with probability one. Hence, (2.12) fails 
and, therefore, condition (C.5) holds. Thus, for the process {Xi}i^z of this 
example, the Berry-Esseen theorem for the sample quantiles holds solely un- 
der condition (C.l) and (2.11). Note that (2.11) requires only a polynomial 
decay of the autocovariance function of the Gaussian process {^ijiez- 

2.3. The Theorem. We now state the main result of the paper. 

Theorem. Suppose that conditions (C.1)-(C.5) hold. Then there exists 
a constant C £ (0,oo) such that for all n>l, 

sup|P(v^(4 -Cp)<x)-^x)\<-^ 

where ^„ = F~^{p) and ^p = F~^{p). 

Thus, under conditions (C.1)-(C.5), the rate of normal approximation to 
the distribution of normalized sample quantiles of strongly mixing random 
variables is 0(n~^/^). This rate agrees with the standard rate available in 
the case of normalized sample mean and sample quantiles of independent 
random variables. As mentioned earlier, the 0(n~^/^) bound under depen- 
dence is rather surprising, as a similar bound in the case of the sample 
mean of strongly mixing random variables still remains elusive, even under 
an exponential decay of the mixing coefficient a{n) of (2.1). 

The above theorem also extends the result of Reiss (1974) on the rate of 
normal approximation to the distribution of the sample quantiles of i.i.d. 
random variables, by allowing the random variables to be approximately 
strongly mixing. The method of proof employed here is very different from 
Reiss (1974) proof which heavily exploits the formula for the probability 
density functions of the sample quantiles of i.i.d. random variables. The 
same approach does not extend easily to the dependent case considered here 
as a similar formula for the density is not available for the general class 
of mixing processes. In contrast, our proof makes use of the characteristic 
function techniques of Gotze and Hipp (1983) and Lahiri (1993, 1996), and 
some uniform bounds on the behavior of characteristic functions of indicator 
variables in the neighborhood of the population quantile F^^{p), which may 
be of some independent interest. See Lemma 3.3 in Section 3. 
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3. Proofs. In the proofs below, we write C, C(-) to denote generic con- 
stants with values in (0,oo) that may depend on the arguments (if any), 
but not on the variables, n,x,y. Also, unless otherwise mentioned, we take 
limits by letting n — > oo. Let l = For any two real numbers x,y, let 

X Ay = min{x, y} and x V y = max{x, y}. 

By the definition of the sample quantile, for any y E M, 

(3.1) P{Fn{y) >p)< P{L <y)< P{Fn{y)>v)- 

Hence, we consider the sums X^iLi ^ u) for y in a neighborhood of 
and study the rate of convergence of the upper and the lower bounds in 
(3.1). The first result gives an expansion for the log-characteristic function 
of a scaled sum of a transformed sequence {/„(Xj)}jgz of random variables 
in a neighborhood of the origin. 

Lemma 3.1. For each n G N, /et : M ^ [— 1, 1] he a Borel measurable 
function such that Efn{Xi) = 0, E\fn{Xi) - fn{xli.)\ < Ck'^o for all i E Z, 
/c G N and 



(3.2) 



n 



''Varij2f,,{X,)]=l. 



Let Wni = fn{Xi),ie'L,n>l,Sn = n~^l^ ELi Hn{t) = Eexp{itSn),t e 
M. Also, let Xr,n denote the rth cumulant of Sn- Then for any e € (0, 1/4), 



sup 

teA„ 



log£'exp(itS'„ 



r=2 



<C{e)[ sup \Hn{t)[ 



n 



2e(aoVft)) 



{n 



-l/2-ao/4 



+ n 



/4} 



+ C(e)n-i/2-6^'( 1+ sup \ein{tt 



for alln > 1, where An = {t G M: |i| < (logn)-'^/^(loglog(n-|-l))^/'^} and where 
din (t) is as defined in ( 3. 7) below. 



Proof. For any random variables Vi, . . . , Vp, p £N, set 
(3.3) 



}Ct{Vi,...,Vp) 



d 



dx. 



log£'exp(itS'„ + xiVi H XpVp)\x^=...= 



d 

dxi <^u.p 

Then using Taylor's expansion of the cumulant generating function 
"log£'exp(itS'„)" around i = 0, we get 



(3.4) 



logii^exp(it5n 



1^ —rXr^n 



r=2 



(fc) 

k=0 
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for any t G M with \Eexp{LtSn)\ > 0, where ij = r]{t) £ [0, 1], Vj = tWnj/y/n 
and for a given k, the summation extends over ji,. . . ,jQ with maximal 
gap k. Note that by Lemma 3.1 of Lahiri (1996) (with c„ = 1, t = 0), for any 
oi, . . . , G M, with \aj\ < 1, r > 2, 

n-l 

\}CoiaiSn, . . . , arSn)\ < C7(r)n-('-2)/2 ^ fc--i[a(fc/3) + /3.(A;/3)] 

k=0 

(3.5) 

< C(r)n-(^-2)/2, 

provided oq > r, /Jq > where /J,,, (k) = /c"^" , A; € N. 

Next, fix e G (0,1/4) and let a.„ = Ji^/''"^ Then by (3.4) above and by 
Lemma 3.2 of Lahiri (1996) (with c„ = 1), as in the proof of his Lemma 3.6 
[cf. (3.9), op. cit.], 

an (k) 
k=0 

an 

<Y,n{k + lfCn-^{l + \tf) 

k=0 

(3.6) X {(1 + 9in{rit)f + (1 + 02n{vt)f} 
<C{e)a'^n-\l + \tf) 

X [1 + \9in{t)\^ + |i7„(t)r6{n-3"o/4 + n ■ n'^^*/"}], 
for all t € An, where 

(3.7) d,n{t) = — ^ max{|£;exp(5P)| : 1 < Z < L, |/| < 4, / C {1, . . . ,n}}, 

and where ^2n(t) = \Hn{t)\-^[L2^{a{m) + n/3,(m)} + {0("^)}^], Ct{k) = 
C\t\ X (n~^A;)^/^ [correcting for a typographical error in Lahiri (1996)] for 

A; G N, m = and L = loglogn. Here, for I C {1, . . . ,n},l > 0, sf^ = 

^j^-i/2^^*(0 where the summation ^*(') ranges over all j G {1, . . . ,n} 
such that \j — i\ > Ira for all i G /. Next using Lemma 3.3 of Lahiri (1996) 
with K = L,m = 2>Kn^^ and Cn = 1, for each k G {an,n), as in the proof of 
(3.10), page 217 of Lahiri (1996) [correcting for the typo, where (1 + ||t|r/^) 
is replaced with (1 + O?^"^'^^], we get 

n-l (fc) 
k=an+l 

- n^\Hni7lt)f 
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e\L 



(3.8) 



<C{e){l + \tf)\H4vt)\-' 
xL2^{n="«a(f-"«)+n-n^ 



n 



£L/4| 
ao/4 



< C{e) ■ \Hn{rit)\-^ ■ n2=(°o^^o) • n'^/^ ■ {n'^o/^ + n ■ n"^"/^} 
for all t G An- Hence, the lemma follows from (3.4) and (3.6)-(3.8). □ 

Remark. It is possible to obtain a bound on the difference between 
E exp{itSn) and its sth order Taylor expansion J2r=2 ^7T~Xr,n for an integer 
s > 3 by suitably modifying the arguments in the proof of Lemma 3.1. It 
can be shown that for a small 5 > 0, a bound of the order 0{n~^^'^~^) on 
the difference is assured, if the strong-mixing exponent satisfies 



(3.9) 



ao > s + 4 + 9/(s -2). 



By minimizing the right-hand side of (3.9), we get 5 = 5, which explains the 
reason behind considering the 5th order Taylor's expansion in Lemma 3.1. 



Lemma 3.2. Let Wnj 's and Sn be as in Lemma 3.1. 

(i) Then for any a G (0, 1/2), there exists a constant Cq = Co(ao, /3oi 70) «) 
such that for all n>Co, 



\Hn{t)\<Co 



exp 



-t 

l-a I 



2 r 



1 



Cn 



(logn) 



+ + n-^/^o + n-''^o}(logn)^« 



uniformly in \t\ < n(^~")/^(logn). 

(ii) There exist eq G (0, 1) and Ci = Ci{aQ, /3o) 7o) £ (0) co) such that for 
all n>Ci, 

\Hn{t)\ > eoexp(-tV2) - Ci ■ (logn)^i + „i/4 . „-/3o/2 ^ ^-70/2] 

for all \t\ < Eq logn. 

Proof. Let m = mi(logn)~^ and mi = n'^(logn)~^. Let l,ji,...,ji be 
the integers defined on page 218 of Lahiri (1996) with / = {1, . . . , n} and Ii = 
{mi + 1, . . .,n-mi}. Also, let Tk = Yl{exp{LtWnj / y/n) : j G /, \j-jk\ < mi}, 
k = 1, . . . ,1, and B = Yl{exp{LtWnj / \/n) :j £ L, \ j — jk\ > nii for all k = 
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1, . . . ,1}. Then using the arguments leading to (3.11) of Lahiri (1996) (with 
Cn = l,R= 1), for all n > 1, t E R, we get 



\Hn{t)\ 



(3.10) 



<CY[E\E{rk\Vj:j^ 



Jk) 



k=l 



+ C[Za(m) + Z7(m) +/3*(m){m + n^/2|t|}]. 
Next, note that by (3.2) and the stationarity of Xj's, 



m 



(3.11) 



j=mi+l 1 j=l 



^ {a(i/3) +/3,(j73)} + _^j{a(j/3) +/?.(i/3)} 

.j=mi+l J=l 



<c ^ ^ 

-j='mi+l 

<C(ao,/3o)K°"+' 



Hence, by (3.12) and the arguments following it on page 219 of Lahiri (1996), 
and by (3.10) and (3.11) above, it follows that for all n > C{ao,Po), 



Y[\E{rk\V,:j^ 



Jk) 



k=l 



(3.12) <Cexp 



< C7exp(^-^[1 - C7(a,ao,/3o)(logn)-2]^ 



for all \t\ < n^^ '^-'/^(logn), where in the second inequality, we have made 
use of the fact 

(3.13) 2mil = n[l — 0{n~^mi + mi^m)] as n ^ oo. 

Hence, part (i) of the lemma follows from (3.10) and (3.12). Part (ii) can be 
proved by retracing the arguments on pages 221-222 of Lahiri (1996), with 
Cn = 1. We omit the routine details. □ 

For the next lemma, let C be a sub-cj-algebra of J^, G{y; ■) = P{Xi < y\C), 
Ai{y) = {uj:G{y;uj) = 1} and ^2(2/) = {uj:0< G{y;uj) < l},y G M. Also, let 
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g{y) = P{{uj : G{y- uo) = 1}) = P{Ai {y)),y G M. Let ^>a{t) = (ae'* + 1 - a), t e 
M denote the characteristic function of a random variable Y with P{Y = 
0) = 1 -a,P(y = 1) = a,ae (0,1). 

Lemma 3.3. If g{^p) <P, then there exist 5,e£ (0,1) such that for all 
t £ R, 

sup E\E{eM'-tI{Xi < y))\C}\< 1 - (1 - |*.(t)|)<5. 

I?/-Spl<'5 

Proof. By definition, for all y G M, 

Fiy) = P{Xi <y)= E{P{Xi < y\C)} 

(3.14) 

= / G{y--)dP=l G{y;-)dP + g{y). 

Note that Ai{yi) C ^1(2/2) for all yi < 2/2 and that G(-;w) is a valid distri- 
bution function for each u; G 0. We claim that: 

(i) g{-) is nondecreasing, and 

(ii) g{-) is right continuous on M. 

The first assertion is immediate. To prove (ii), note that for any sequence 
2/n i y G K, 

V : G{y; uj) = 1} C f] : G{yn] oj) = 1} 

n>l 

[by the monotonicity of G{y; •) in y] while the reverse inclusion follows from 
the right continuity of G{y;uj) in y for each lu. Since F{(,p) =p, by (3.14), 

(3.15) 9{Cp)<P- 

Now suppose that g{£,p) < p. Then by the right continuity oi g{-), there exists 
a 5q> such that 

(3.16) g{Cp + So)<p-5o. 

By (i), this implies that g{y) <p — 6o for all y < Cp + ^o- Since F is continuous 
at ^p, there exists a < 61 < 60 such that 

(3.17) F{^p-6i)>p-[5o/2]. 

Next, write A3{y;e) = {uj : e < G{y;uj) < l-e},A4 = {uj:G{y-uj) > l-e},eG 
(0, l),y e M. Note that A4^{y;e) [ Ai{y) as e j and ^4(1/; e) C ^4(2/ + /i;e) 
for all y GM,h> 0,e € (0, 1). In particular, for any 1/ G M, 

(3.18) hm/ G{y;-)dP=( G{y;-) dP = P{A^{y)) = g{y). 
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Hence, by (3.14) and (3.16)-(3.18), there exists < e < 6o/8 such that for 
aU y€ {Cp-Si,^p + 5i), 

P{A^{y;e))> j G{y;-)dP 

= jG{y;-)dP- j G{y;-)I{G{y;-)<e)dP 

(3.19) - j G{y--)I{G{y--)>l-e)dP 

> F{y) -e- jG{ip + 5i- ■)I{G{Cp + Si;-) >1 - e) dP 

>FiCp-6i)-e-[g{Cp + 6i) + e] 

>\p-6o/2]-2e-\p-6o] 

= So/4.. 

Next, writing ^e{t) = jee'* + 1 — e|,f G M, and G{y) = G{y; •) (for notational 
simpUcity), by (3.19), for y € {^p — Si,^p + 6i), we have 

E\E{exp{LtI{Xi<y))\C)\ 

= E\G{y)e'' + {l-G{y))\ 

= E\l-iGiy)il-Giy))sm\t/2)\'/' 

(3.20) < P{Ai{y;e))+ElA,^y,,) • |1 - 4G(y)(l - G{y)) sm\t/2)\'/^ 
< P{Al{y;e)) + |1 - e(l - e)s\n\t/2)\"^ P{A^{y;e)) 

= l-{l-\^S)\)P{A^{y;e)) 

<l-(l-|*,(t)|)<5o/4. □ 
Proof of the Theorem. First, we shall show that 

A;= sup \P{y/7i{in-ip)<x)-^x/TM)\ 
\x\<\ogn 

(3.21) 

= 0(n-V2). 

To prove this, we apply inequality (3.1) with y = Xn, where for x € R, we 
set Xn = ^p + n~^/^x. Let cr'^{x) = nVar(F„(x)) and Sn{x) = ^/n{Fn{x) — 
F{x))/an{x), a; G R. By the smoothing inequality [cf. Lemma 2, page 538 of 
Feller (1971)], 



An = sup 

|a;|<logn 



PiFniXn)<p)-'^ 



^{p- F{Xn)) 

an{x) 



BERRY-ESSEEN THEOREM UNDER DEPENDENCE 15 

(3.22) < sup SUp\P{SniXn)<y)-Hy)\ 
|a;|<logn i 



< sup 

|a;|<n'' logn 



1 rKy^ 2 O 

- / \E exp{LtSn{xn)) - e"* /^lltl"-^ dt + 



where k G (0, oo) is a constant (independent of x), to be specified later and 
where C G (0,oo) is a constant independent of n, x. 
Next note that 

supl ^ |Cov(/(Xi < x),I{Xj+i < x)) I : x E M I 

lj=n ) 

oo 

<C^[a(i/3)+/?(j73)) 
— > as n ^ CO. 

Since <7^('?p) > and F is continuous at ^p, by the above fact, there exists 
(5* G (0,5o) such that 

(3.23) hm^inf inf{a2(ep + x) : \x\ < 6,} > al{Cp)/2, 

where 6q is as in Lemma 3.3. Let Mp = {x:\x — ^p| < 5^,}, and let Xr,nix) 
denote the rth cumulant of Sn{x) = ^/n{Fn{x) — F{x)) / cjn{x),x ^J\fp. Then 
it is easy to check that for any i G Z, any x G M and any eo> 0, 

E\I{X,<x)-I{xl^<x)\ 

(3.24) <P{x-e<Xi<x + £) + P{\Xi - > e) 
< [F{x + £)-F{x-e)]+e~^E\X,-Xl^^\. 

Hence, by conditions (C.l) and (C.3), there exists C > and (5** G (0,(5*) 
such that for ah \x - ip\ < d^,^, and i G Z, A; G N [with e = k~f^° in (3.24)], 

(3.25) E\I{Xi <x)- I{XI^ <x)\< Ck-^". 

For notational simplicity, without loss of generality, we shall set 5=^, = 5^, . 
Also, let Wni{x) = [I{Xi <x)- F{x)]/anix),i G Z, n > 1, x G AAp. Then by 
(3.23), (3.25) and condition (C.3), {Wni{x) :i G Z}„>i satisfies the conditions 
of Lemma 3.1 uniformly in x £ Mp. By Lemmas 3.1 and 3.2(ii) above and 
the induction arguments used in the proof of Lemma 3.28 of Gotze and Hipp 
(1983), it follows that there exists ei = ei(ao;/?o) £ (0, 1/4) such that for all 
< e < ei. 



sup sup 

x&Afpfi<logn 



UtY 

log EeyiY>{itSn{x)) - ^ — —XrA^) 

r=2 
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< C{e)[n-^l^-^' + {eoexp(-(Vl^)V2)}^' 

(3.26) 

^ ^2£(aoV/3o)|^-l/2-«o/4 _^ ^-l/2-/3o/4|| 

<C7(e,eo)n-i/2-C(e) 

for all n > Ci, where Ci is as in Lemma 3.2(ii). By arguments in the proofs 
of Lemma 3.33 of Gotze and Hipp (1983) and of Lemma 9.7 of Bhattacharya 
and Range Rao (1976) and by (3.5) and (3.26), we have 



sup / \Ee^-p{itSn{x))-e-^^''^\\t\-^dt 



< sup 

x£j\fp Jfi<\ogn 



E exp(LtSn{x)) 



e-*'/'(l + E(^i)'(H)-^Xr,n(x)^ 



+ sup 

x^Mp Jt^<\ogn 

for all n> Ci . Since 

[ e~'"/'dt = o{n-y^), 

J i2>logn 

to show that (5„ of (3.22) is 0{n^^^^), it remains to show that 
(3.27) sup / \Eexp{LtSn{x))\\t\~^dt = 0{7i~^/^). 

x&AfpJilogn)^/2<\t\<Kn^/2 

To this end, we split the set of t-values in (3.27) into the sets Bin = {i ^ 
M: (logri)V2 < \t\ < and B2n = {t G Min'^/^^ < \t\ < KnV2}. Then us- 

ing Lemma 3.2(i) with Wnj = Wnj{x),x G A/^ with a = 1/8, we have 

sup / \Eexp{itSn{x))\\t\~^ dt 
xeJVp J Bi„ 

„„7/16 

<2Co / exp(-tV2) •exp(Cot^(logn)"V2)|ir^c?i 

J(logn)i/2 

+ 2Co / \t\-Ut]{lognf° -n- 

Vj(logn)i/2 / 
flogn 



^Co .„-l/2-l/8 



(3.28) < 2Co 



rlogn 

exp(Co/2) / exp{-t'^/2)\t\-^ dt 

J(logn)i/2 
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,7/16 



rn'' " 

/ exp{-t^/A)\t\-^dt 

J log n 



3g 

+ 2Co(logn)^°+in-V2-i/8 
= o(n-V2). 

Next, note that (1 — n)^/^ < 1 — for all < u < 1. Hence, there exists 
'^^o = '^o('5, e) G (0, oo), depending on S and e of Lemma 3.3 such that for 

1 - (1 - = (1 - + 5(1 - 4e(l - e)sin2(t/2))^/' 

<{l-6)+ 6{l - 2e(l - e) sin2(t/2)) 
<1-C{e,6)t'^. 

Also, note that for a bounded random variable y and cr-fields C ^2 C 
E{Y\Qi) = E{E{Y\Q2)\Gi} a-s. (P)- Hence, setting k = in -B2n, and using 
(3.10) (with a = 1/8), we have 

sup / \E ex.p{LtSn{x))\\t\~^ dt 

XGAfp J B2n 

I 

< sup / l[E\E{Tkix)\Vj:jy^jk)\\t\-^dt 
+ C'[n^~"(ri^"'^"''^*^'^")) • (logn)*-^'^"'''^''''^°^] 

< 21ogn • supj J] E\E{Tkix)\C)\:te B2n,xe AApj + o{n~^/^) 
(3.29) = 21ogn • sup{S|^(exp(it/(Xi < x)/^/^)\C)\:t e B2n,xe MpY 

< (2 logn) - sup {1 - C{e,6) ■ /nY + o{n-^/'^) 
= 0(logn • exp(-C(e, 6) ■ n~^/^ ■ I)) + o(n-i/2) 

where I = n/(2mi)(l + o(l)) = n^-''(logn)^(l + o(l)) (with a = 1/8) and 
where the variables Tk{x)^s are defined as in the proof of Lemma 3.2 with 
Wjk = Wjk{x),x GAfp. Hence, (3.27) follows and by (3.1) and the absolute 
continuity of the limiting normal distribution, (3.21) follows. 
Next, note that 

sup \P{Vniin-Cp)<x)-'^{x/Too{p))\ 
x<— logn 
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< PiV^iin -Q<- logn) + $(-logn/Too(:p)) 

(3.30) 

<A: + 2cI>(-logn/roo(p)) 
= 0(n-i/2) 

and similarly, 

(3.31) sup \P{V^{in - Cp) <x)- $(x/roo(p))| = 0(n^i/2). 

x>logn 

Hence, the theorem follows from (3.21), (3.30) and (3.31). □ 
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